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ABSTRACT 

We develop a new formalism for modeling the formation and evolution of galaxies within a 
hierarchical universe. Similarly to standard semi-analytical models we trace galaxies inside 
dark-matter merger-trees which are extracted from a large A^-body simulation. The formalism 
includes treatment of feedback, star-formation, cooling, smooth accretion, gas stripping in 
satellite galaxies, and merger-induced star bursts. However, unlike in other models, each pro- 
cess is assumed to have an efficiency which depends only on the host halo mass and redshift. 
This allows us to describe the various components of the model in a simple and transparent 
way. By allowing the efficiencies to have any value for a given halo mass and redshift, we can 
easily encompass a large range of scenarios. To demonstrate this point, we examine several 
different galaxy formation models, which are all consistent with the observational data. Each 
model is characterized by a different unique feature: cold accretion in low mass haloes, zero 
feedback, stars formed only in merger-induced bursts, and shutdown of star-formation after 
mergers. Using these models we are able to examine the degeneracy inherent in galaxy for- 
mation models, and look for observational data that will help to break this degeneracy. We 
show that the full distribution of star-formation rates in a given stellar mass bin is promising 
in constraining the models. We compare our approach in detail to the semi-analytical model 
of De Lucia & Blaizot. It is shown that our formalism is able to produce a very similar popula- 
tion of galaxies once the same median efficiencies per halo mass and redshift are being used. 
Consequently, our approach may be useful for comparing various published semi-analytical 
models within the same framework. We provide a public version of the model galaxies on our 
web-page, along with a tool for running models with user-defined parameters. Our model is 
able to provide results for a 62.5 /i^^ Mpc box within just a few seconds. 
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1 INTRODUCTION 

The physics of galaxy formation combines a wide range of as- 
trophysical phenomena. On one side it relies heavily on cosmol- 
ogy and on the hierarchical formation of dark matter structures in 
our Universe. On the other hand galaxy formation is greatly af- 
fected by the detailed physics of gas dynamics, star-formation (SF) 
processes, stellar evolution, and black-hole physics. A model that 
will combine all the above physics into a well-defined, coherent 
methodology should thus include a large dynamical range of scales, 
from the formation of individual stars to the formation of the largest 
clusters of galaxies. 

Historically, a detailed understanding of the formation of 
galaxies has grown alongside with th e de velopment of the 
concepts of dark-matter clustering (e.g. [Rees & Ostrike^ 1 19771 ; 
iBlumenthal et al.lll984[white & Frenklll99lh . It became clear that 
galaxy formation is a 'two-stage process', where first dark matter 
clumps collapse and virialize into 'haloes'. This is then followed 
by a second stage of gas cooling into the central regions of these 
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haloes, which can condense and form stars ( IWhite&Reeill978h . 
In the last few decades there have been substantial improvements 
in our quantitative understanding of structure formation of dark- 
matter, both from the analytical point of view jPress & SchechteJ 
1 1974 iBond et al.|[T99ll : ISheth & Tormet]|2002h and also using nu- 
merical simulations (e.g. lUacev & Colelll994l : lNavarro et al.ll 19971 : 
ISDringeletai]|2005l) . The current level of accuracy in modeling 
dark-matter structure growth has reached the level of few percents. 
Some larger uncertainties are still inherent in the halo definition 
and its virial mass, but the way these uncertainties affect galaxies 
is partially related to the virialization of gas inside haloes, and thus 
cannot be fully understood using dark-matter only. 

Unlike the physics of dark matter, the processes that gov- 
ern gas dynamics and SF still remain highly uncertain, usually at 
the level of at least an order of magnitude. For example, cool- 
ing rates may be dramatically affected by the assumed shape of 
the density profiles or di fferent assumptions made in computing 
the cooling function (e.g. 
nif ■ 
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In view of the above difficulties to model galaxies, three 
main methodologies were developed in the literature. The first ap- 
proach is a statistical one, attempting to interpret the data with 
a minimum set of assumptions. This includes variants of the 
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tifying the relationships between galaxies and haloes, the cluster- 
ing properties of both populations, and the properties of satellite 
galaxies. However, by definition, this formalism does not allow to 
follow the evolution of galaxies with time, and to study their forma- 
tion h istories (but see Drory & Alvarez 2008; Conroy & Wechsle^ 
l2009l) . 

A totally different approach is the detailed modeling of in- 
dividual galaxies using high resolution simulations which include 
gas hydrodynamics, and a s imple p rescription s for SF, feedback 
and b lack hole growth (e.g. lAbadi et al. 2003i; IScannapieco et al.] 
l2009h . This approach is valuable for testing specific ingredients of 
the models (i.e. different feedback mechanisms) and it is critical 
to deepen our physical understanding of these processes. However, 
it is still difficult to obtain a statistical sample of galaxies using 
such simulations, o r to span a large ra nge of possible models (see 
a recent attempt bv lSchave et"ai]|2009l) . In addition, processes like 
star formation and supernova (SN) feedback occur far below the 
resolution limits of these simulations, which makes it necessary to 
describe them with analytical and approximate recipes. The finite 
number of particles representing the baryonic content of galaxies, 
and the presence of numerical artifacts make it difficult to find ap- 
propriate recipes which lead to numerical convergence while pro- 
ducing realistic model galaxies. 

The most widely used methodology to interpret observations 
and to provide a coherent picture for the physics of galaxy forma- 
tion is implemented by Semi-Ana lytical Models (hereaft er SAMs). 
The approach was introduced by White & Frenk ('l99lj) and then 
implemented b vlCol e ( 199 1 ); Kauffm ann et al. ( 1993.). Mor e recen t 
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Benson etai] (|2002|); iHatton et alj 120031); iBower 



2007); [Monaco etal 



Cattaneo etal] (l2006h; ICroton et all (|2006|); iDe Lucia & Blaizot 



( l2007h ; lKang et al.l l l2005t) ; ISomerville et al 



20081) . These models provide a detailed picture of galaxy for- 
mation processes and produce a population of simulated galax- 
ies whose global statistical properties resemble the observations 
in many aspects. Consequently, the models can then be used to 
examine specific issues in the formation of galaxies, like the im- 
pact of different proces ses on the shape of the luminosity func- 
tion jBensQ^^j] 2003h . the formation history of e lliptical galax- 



ies (jPe Lucia et al.ll2006l) . and galaxy merger-rates jGuo & Whit3 
bOOsI) . SA Ms are also valua ble for developing new observational 
techniques jChen et al]|2009h . 

The SAM approach is in an intermediate position between the 
detailed hydrodynamical simulations and variants of the halo model 
discussed above. While in contrast to hydrodynamical simulations, 
SAMs include no 3D information on individual galaxies, they fol- 
low the evolution of galaxies within their dark-matter haloes, which 
cannot be done using the 'halo model'. In SAMs, halo merger-trees 
are used as a skeleton for the evolution of galaxies. The first gen- 
eration of models used Monte-Carlo realizations based on the ex- 



tended Press-Schechter approach, while modern SAMs use trees 
extracted from Ai'-body simulations. Recent versions of these mod- 
els include merger-trees which follow the evolution of substructure 
inside haloes. Within these haloes galaxies are assumed to grow, 
with each galaxy being described by a small number of parame- 
ters. This means that only a few mass components are followed 
with time; hot gas, cold gas, stars, black-hole mass, gas which was 
ejected out of the halo, heavy elements and the fraction of mass 
contained in the bulge. SAMs include recipes that are used to com- 
pute the rate of change in each component, according to the prop- 
erties of the galaxy and its host halo. For example, the SF rate is 
usually assumed to be proportional to the mass of cold gas in the 
disk, divided by the disk time-scale. These recipes then enable a fast 
and detailed evolution of galaxies within the model. The benefit of 
SAMs over other approaches is that using a few simple assump- 
tions the model can reproduce the detailed formation histories of 
galaxies. 

Although SAMs have been quite successful in generating 
model galaxies that resemble observations, there are a few limi- 
tations to this method that we would like to address. First, it seems 
that the degeneracy inherent in these models has not been suffi- 
ciently explored. Often a fixed functional form is used to describe 
a given process, and tuning is only done by varying parameters. In 
addition, it is not always tested whether the more complex mod- 
els can be replaced by simple recipes. For example, the physics of 
AGN feedback is poorly understood, but nevertheless implemented 



with a relatively detailed r e cipe in certain SAMs (e.g. ICroton et al 



Bower et al 



2004 [Monaco et alj|2007h . ICattaneo et al] ( |2006|) and 

1 20061) have shown that simply suppressing cooling within massive 
haloes gives results which are very similar to the complex mod- 
els m entioned above (see however the discussion in lPe Lucia et al.l 
l2006l) . This means that the recipe for AGN feedback can be eas- 
ily summarized by a range of halo mass and redshift for which the 
suppression is active. This fact cannot directly be seen from the de- 
tailed equations being used, and the effective range in halo mass 
where this suppression is active is not well examined. 

A different limitation of SAMs is that some recipes are hard- 
wired into a complex simulation code and inter-connected in non- 
trivial ways. It can thus be difficult for users to change the func- 
tional form of these recipes. For example, cooling efficiencies are 
computed using information on the density profile of hot gas within 
haloes, combined with the physics of radiative cooling processes. 
There is no practical way to change the effective cooling efficien- 
cies for ranges in halo mass and redshift, and to test its effect on 
the population of galaxies. In addition, testing a large number of 
very different recipes is nearly impossible, and it can be very hard 
or impossible to tune a SAM in order to match observations if no 
freedom i n the functiona l form s is allowed. For example, as was 
shown by iFontanot et al.l ( |200^ , various SAMs are unable to re- 
produce the decrease of the specific SF rate as a function of stellar 
mass. Up to now, it has been unclear if this limitation is fundamen- 
tal to the SAM approach or not. We will show below that once the 
functional shape of the recipes is relaxed, this observational feature 
can easily be reproduced. 

Another minor disadvantage is that SAM recipes depend oc- 
casionally on a large chain of computational steps, which do not 
allow the reader to extract the actual values being used. For exam- 
ple, it is difficult to estimate the actual cooling and SF efficiencies 
used by a given SAM, which makes it nearly impossible to compare 
different SAMs easily. 

In this work we develop a new approach for following a galaxy 
within a given merger-tree. We present a formalism that is more 
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simple and transparent than the usual implementation of a set of 
recipes, but on the other hand general and flexible enough so that 
model galaxies will reproduce the observational constraints accu- 
rately. We will show that it is possible to simplify the SAM con- 
siderably by only allowing the recipes to depend on halo mass and 
redshift. We find that this simplification does not reduce the level 
of complexity in the modeled galaxies, but it enables a very simple 
and transparent description of the various physical processes. As a 
second step, in order to keep the model general and flexible enough, 
we do not restrict the functional dependence of each process on the 
halo mass and redshift. This generalization greatly facilitates the 
tuning process. The formalism we propose is more compact than 
usual SAMs and can be represented by a few simple functions. 

The main assumption in our formalism is that each physical 
process can be described well enough by its dependence on the host 
halo mass and redshift. Few SAM recipes are already following this 
assumption. For example, the SN feedback, including reheating, 
ejection and reincorporation of disk gas, only depends on the virial 
veloci ty and virial mass of the host halo in iDe Lucia & Blaizoll 
( l2007t) . There are other recipes which are in general not a simple 
function of halo mass and time. For example, the cooling efficiency 
is usually modeled in SAMs by following the density profile of the 
hot gas component, and computing the radius within which gas has 
had time to cool. By extracting cooling effic iencies and star for- 
mation efficiencies directly from the SAM of iDe Lucia & Blaizoj 
( I2OOI hereafter DLB07) we test our main assumption and show 
that it is relatively accurate for all the recipes used in this model. 

In this paper we use our formalism to explore the level of de- 
generacy inherent in current galaxy formation models. We try to 
examine very different models of galaxy formation that are all con- 
sistent with the observational data. We deliberately consider also 
some very extreme and physically less well motivated models in or- 
der to span a large range of solutions allowed by the observations. 
This will help us better to understand the degeneracies involved in 
SAMs. Specific questions can then be addressed by comparing the 
different modeled galaxies. For example, we derive the minimum 
cooling efficiencies which are consistent with the observed popu- 
lation of galaxies; we examine how much SN feedback is required 
in a model with instantaneous cold accretion at low halo masses; 
and we investigate a model in which star formation and cooling 
terminates after major mergers. 

This paper is organized as follows. We start with a review of 
the current SAM ingredients and their uncertainties in section [2] 
In section [3] we present our general formalism and explain where 
it differs from previous works. Our approach is then compared 
with a state-of-the-art SAM in ij4] Section [5] summarizes the var- 
ious ingredients of each specific model we develop here. These 
models are further discussed in ij6] where their results are com- 
pared to observational data. We discuss our results and summarize 
them in [jT] Throughout this paper we use the cosmological model 
adopted by the Millennium simulation with (yi-m, ^h, h, as) = 
(0.25, 0.75, 0.73, 0.9). We use log to designated log^p, t is the 
time in Gyr since the big-bang. 



2 MOTIVATION: UNCERTAINTIES IN SAM 
INGREDIENTS 

In the models presented in this work we change both the absolute 
values and the parameterization of the efficiencies of star forma- 
tion, SN feedback and cooling more freely than what is usually 
done in standard semi-analytical models. We also allow the pre- 



scriptions for dynamical friction and merger-induced star-bursts to 
vary in part of our models. Here we give a brief explanation why 
we believe the standard recipes might be overly restrictive. Al- 
though these recipes may also be modified within standard SAMs, 
we argue that our formalism allows easier and better controlable 
changes. 



2.1 Cooling 

The efficiency with which the hot gas radiates and cools is normally 
not subject to any tuning in SAMs . Cool ing efficiencies are calcu- 
lated accordingjo WM^^^re^ ( 1 199 ih . using cooling functions 
from lSutherland & Dopitj h993h and assuming an isothermal or 
NFW gas density profiltQ. This makes cooling very efficient, which 
is the reason that strong SN and AGN feedback are needed in all 
SAMs. Next to the dark matter physics, cooling rates are probably 
the most fundamental ingredient in SAMs and shape many of the 
secondary ingredients. However, various complications challenge 
the generally used cooling prescription. 

The assumed density profile of the hot gas within haloes can 
greatly affect the cooling efficiencies, as cooling depends strongly 
on the local gas density. Only a few semi-analytical models dif- 
fer from the standa rd prescription b y assuming a different gas 
density profile (e.g. ICole et al]|20QOl) or considering the d ynami- 

How- 



cal adjustment of t he density profile llCattaneo 



lenne thedj 
et alll2006l). 



ever, as shown by [McCarthy et"al] j2008bh and iKaufmann et al.l 



( l2009l) . assuming a cored gas density profile instead of the com- 
monly used steeper profiles greatly decreases cooling rates, and 
thus the need for strong AGN and SN feedback both in clus- 
ters and Milky- Way sized haloes. Various 'preheating' mechanisms 
which may decrease the central density and entropy of the gas ac- 
cording ly have been su ggested. These include: gravitational pan- 
caking jMo e t alj |2005l) ; early quasar feedback ( ILu&Mo 20q3; 
McCarthy et al. '2008b!); feedback produced by an early star burst 
Tang et alJI 20091) ; and the early chaotic assembly phase of haloes 



Keres et al.ll200' 



At gas densities typical of the hot gas haloes around low mass 
galaxies, the assump tion of coUisional ionization equilibrium that is 
used to calculate the Sutherland & Dopit3 j 19931) cooling function s 
may break down l lGnat & Sternberg! I2OO7I ; IWiersma et al.ll2009l) . 



This may decrease cooling rates by roughly an order of magni- 
tude. The effect could even be more dramatic in the presence of 
radiation emanating fro m an AGN or youn g stars residing in the 
galaxy (see the SAM bv lBenson et al.ll2002h . Departing from non- 
solar abundance ratios can also affect coolin g rates by a factor of a 
few fSmith et al. 20081; Iwiersma et al.ll2009l) . 

Finally, as SAMs only assume spherical density gas profiles, 
the rich three-dimensional effects induced by e.g. cold filaments 
cannot be addressed in detail. Several comparisons were made be- 
tween hydrodynamical simulations and the s implified SAM ap- 
proach, mostly find i ng a reasonable agreement (iBenson et al. I2OOII ; 



lYoshida et al.l2002l;llMiv et al.l2003| - |Cattaneo et al.l2007l) . On the 
other hand. IViola et al.l 1 I2OO8I) claim that details in the calculation 
might change cooling rates by an order of magnitude at high red- 
shift. 



^ Note however, that some freedom is left for fixing the age of the halo 
which determines the radius inside which gas is allowed to cool. The age of 
the halo might coiTesponds to the age of the universe, the last major merger 
event, the first time the halo was identified, or the dynamical time of the 
halo. 
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2.2 Star formation 

Most SAMs assume that the efficiency with which cold gas is con- 
verted into stars is inversely proportional to the dynamical time of 
the disk. In some models a critical threshold density of the cold gas 
is used, below which stars are not able to form. These laws replace 
a large amount of complicated physics which cannot be traced in 
detail within SAMs. Modern observational studies of SF in local 
galaxies stress that different components may play a role in reg- 
ulating SF within disks: the atomic gas (Hi), molecular gas (H2, 
found inside gi ant molecular clouds), and stars, which are all inter- 
connected (e.g. lWong & Blitzll2002ljFumagalli et"ai]|2009l) . 

Empirical laws may be derived from low redshift observa- 
tions and can then be used by SAMs to model galaxies at high- 
z as well. SAMs generally make use of the e mpirical Schmidt- 
Kennicutt law HKennicuttll 19981 : ISchmidt|[T95^ . which relates the 
star formation rate to the total cold gas surface density, i.e. the sum 
of the Hi and H2 masses. However , the u ncertainty inherent in 
this law is large llBoissier et al.ll2007l) . Both lWong & Blitj l l2002h 
and lLerovet"al][2008h argue in favour of a Sc hmidt law based on 
the surface density of the molecular gas alone. iLerov et al] ( l2008h 
found that in relation to the total gas density, SF efficiencies can 
vary between 10~^ up to 1 Gyr"'^. Modeling the two phases of 
cold gas seems difficult as it involv es pressure estimates within 
the disk jBlitz & Rosolowskvl l200d). and the format i on ph ysics 
of giant molecular cloud s (but see lObreschkow et al. I 1 I2OO9I) and 
iDutton & van den BoschI j2009l) for recent attempts). Lastly, the 
exi stence of a threshold in gas density for SF that was suggested 
by iKennicuttI l|l998;) is still debated Ischaye 2004; Boissier e t al] 
l2007h . In particular, including such a threshold might not be nec- 
essary if SF is assumed to depend on the molecular gas density 
instead of the total gas d ensity, since the molecular frac tion is sup- 
pressed at low densities jPutton & van den BosclJl2009l) . 

As mentioned above, observations find correlations between 
SF rates and the cold gas surface density and not the total mass of 
cold gas in the disk. This is modeled in most SAMs by computing 
the disk radius and disk dynamical time using the halo spin pa- 
rameter, and assuming a smooth gas distribution. We do not follow 
this method here, which essentially means that we assume some 
average disk radius in a fixed bin of halo mass and time. This 
seems to be a reasonable assumption as the spin parameter shows a 
small dependenc e on halo mass, along with a large general scatter 
teett et alj|2007h . This means that not considering the spin param- 
eter mainly reduce the scatter in the SF efficiency. We will show 
below that this does not change the properties of the model galax- 
ies significantly. 



2.3 SN feedback 

Some sort of feedback by stellar processes is an ingredient in vir- 
tually all semi-analytical and hydrodynamical models of galaxy 
formation. Suggested feedback processes include heating by SN 
or photoionization radiation of massive stars, momentum deposi- 
tion by SN, stellar winds, expanding bubbles of photoionized Hll 
regions, and absorpti on and scattering o f starlight by dust grain 
(see the discussion in iMurrav et al1l2009l , and references therein). 
Most SAMs only attempt to model the feedback by SN, and its 
detailed implementation differs greatly between different models 
and is not well constrained b y observations (see e.g. the discus- 
sion in iDe Lucia et al.l |2004|) . Many SAMs combine two differ- 
ent modes of SN feedback: reheating, in which cold disk gas is 
transferred back into the hot mode with a rate equal to a few 



times the star formation rate; and ejection, in which part of the 
hot or cold gas is assumed to leave the halo, where it only be- 
com es available again for cooling after roughly one dynamical time 
(e.g. iDe Lucia et al.ll2004 ICroton et al. 2006: Bower et al. 200^; 
ISomerville et alj 2008 ). Oth er SAMs do not include any ejection 
(e.g. lKauffmann et al.il993h , or assu me that part of the ejected gas 
can never come bac k to the hot phase l lSomerville & Primac kll999l: 
iBertone et"al]|2007l) . 

The efficiency of reheating and ejection adopted in SAMs is 
mainly determined by the need to reconcile high cooling rates with 
the low number of stars observed. It is only vaguely based on di- 
rect observational constraints of outflows in galaxies. For exam- 
ple, the reheati ng efficiencies used by DLB07 are based on the 
observations by iMartinI i ll 9991) of strongly star-bursting systems, 
which are a population of galaxies with properties at the extreme 
of the distribution. However, there are indications that outflows can 
only be generated above som e limiting star formation density (e.g. 
IStrickland & Heckmanll2009l) and that it is thus not appropriate to 
apply the same recipe to all galaxies. Note also that stellar feedback 
by other sources than SN might scale very differently with galaxy 
properties. 



2.4 Merger-induced SF bursts 

The efficiency of SF in merger induced bursts is defined as the frac- 
tion of cold gas converted into stars during the merger event, after 
subtracting the amount of stars produced in the quiescent mode. 
Numerical simulations which include SF, feedback and coolin g can 
be used to quantify the SF efficiency jCox et aL I l2006ll2008h . Al- 
though the efficiency computed by these simulations usually agrees 
with the standard recipes used in SAMs (Eq. I14t . the dependence 
on additional parameters like the shape of the orbit and bulge mass 
might propagate systematic errors into the SAMs. In addition, the 
galaxies being simulated are generally assumed to be typical low- 
redshift galaxies. Efficiencies at high z are still poorly constrained. 

Numerical simulations indicate that the time-scales for burst 
duration depend strongly on the feedback recipe being used. For 
example, ICox et alj ^20081) examine two different feedback recipes 
which deviate by a factor of '--^ 2 — 5 in the burst time-scale. In 
general, the burst duration in these simulations can vary between 
0.1 and 2 Gyr, and its behaviour at high-z has yet to be investigated. 

In most current SAMs, the SF burst is assumed to start after 
the galaxies have coalesced into one remnant galaxy. This is very 
different from numerical simulations where bursts can be triggered 
at intermediate close passages between galaxies. This effect may 
limit the SAM's ability to use proper time-scales for bursts at the 
accurate time within the merger event. In order to use long time- 
scales which mimic few several small bursts, one may need to de- 
crease the dynamical-friction time, so that the burst timing will be 
accurate. 



2.5 Dynamical friction timescales 

In order to properly model the behaviour of satellite galaxies, all 
SAMs need an analytical prescription for dynamical friction. If 
merger trees from A^-body simulation are used, such a prescrip- 
tion is necessary because subhaloes may fall below the resolution 
limit due to tidal stripping, which means their orbits cannot be 
traced anymore. If FOF merger trees are used, on the other hand, 
it is needed as soon as a galaxy becomes a satellite since no orbital 
information is available thereafter 
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The time it takes galaxies to sinlc into tlie center of the host 
potential well and then merge with the central galaxy is usually es- 
timated by the Chandrasekhar formula (see Eq. ll2b . Recent studies 
are able to estimate the timescale more accurately using high res- 
olution simulations and taking into a ccount detailed o rbital infor- 
mation l lBovlan-Kolchin et alj [20081: 1 Jiang et alj|2008l) . However, 
the orbital information is usually not considered by SAMs. An ad- 
ditional problem arises because it is unclear how the mass of the 
sinking galaxy/subhalo should be estimated; using the mass of the 
subhalo when it was last identified or the baryonic mass of the 
galaxy instead changes the dynamical friction timescales signifi- 
cantly. This can be seen from Eq.[T2]below, where the dependence 
on the satellite mass is strong. 

These problems may bias the estimates of the dynamical 
friction timescale high or low. Indeed, DLB07 had to increase 
their dyna mical friction timescales by a factor of 2 compared to 
ICroton et a l. (2006) in order to fit the observational data. It may 
well be that other models of galaxy formation would require a 
higher correction factor, or one that depends non-trivially on time 
or halo mass. 



2.6 Environmental Effects 

In most SAMs galaxies lose their entire hot gas reservoir as soon as 
they become satellites inside a FOF group, and this hot gas then 
becomes directly available for cooling to central galaxies. This 
leads to satellites which are too red compared to observations (e.g 
IWeinmann etaLll2006l : Isaldrv et allliooa : iKimm et al.ll2009l) and 
may also lead to an overestimate of hot gas that is available to cool 
to the central galaxy. An improved SAM which is more successful 
in reproducing o bservations in this respect has been presented by 
lFontetalJ ( l2008h . 

Environmental effects are subject to large uncertainties, as ef- 
ficiencies of ram-pressure and tidal stripping are not well known 
and de pend on the detailed orbit of the satellite l lMcCarthv et al.l 
l2008ah . Environmental effects are also strongly linked to the 
detailed prescription for SN feedback in satellite galaxies (e.g. 
IOkamotoet"aLll2009l) and the resulting gas density profiles of the 
satellites. Furthermore, it is unclear if stripped satellite gas should 
be made available for cooling to the central galaxy in all cases; 
a large fraction of galaxies commonly identified as satellites in 
FOF groups reside outside the virial radius of the central galaxy 
which makes it unlikely that their gas reservoir is immediately 
added to the gas reservoir of the central galaxy. One promising 
approach seems to be to stri p the diffuse gas aroun d satellites in 
proportion to the dark matter jWeinmann et al .l2009h . Here, we use 
a more simplified method and strip the hot gas around satellites ex- 
ponentially, with an exponential timescale of 4 Gyr. Note that we 
do not include a prescription for the disruption of the cold gas or 
stellar component of the galaxy, which might be important for re- 
producing the corr elation function on small scales or halo occupa- 
tions statistics (e.g. iHenrigues et al.l2008l ; lKim et al.ll2009h . Such a 
recipe could however easily be incorporated into our methodology. 



3 THE FORMALISM 

In this section we describe in detail the formalism we develop for 
modeling the formation and evolution of galaxies. This formal- 



ism is very general and is not restricted to a given modej^. Con- 
sequently, we do not describe in this section the functional shape of 
each process, or the parameters being used. The motivation in set- 
ting up this approach is to have a methodology which is conceptu- 
ally simple, includes most of the physical processes, and is general 
enough so that very different models can be described within the 
same language. 

3.1 Merger trees 

We use merger trees extracted from the Millennium A'^- 
body simulation dSpringel et akl l2005h . This simulation was 
run using the cosmological parameters (Sim, S^a, h, erg) = 
(0.25, 0.75, 0.73, 0.9), with a particle mass of 8.6 x 10** h"'^ Mq 
and a box size of 500 /i~^Mpc. The merger trees used here 
are based on siibhal oes identified using the SUBFIND algorithm 
jSpringel etalj200ljK They are define d as the bound density peaks 
inside FOF groups jPavis et al.lll985h . More detail s on the simu- 
la tion and the subhalo merger- trees can be found in lSpringel et al.l 
(i20Qi) and lCroton et~ai] ( l2006h . The mass of each subhalo (referred 
to as Mh in what follows) is determined according to the number 
of particles it contains. Within each FOF group the most massive 
subhalo is termed the central subhalo of this group. Throughout 
this paper we will use the term 'haloes' for both subhaloes and the 
central (sub)halo of FOF groups. In general, our approach could 
also be based on FOF merger-trees, which might be generated by 
Monte-Carlo realizations. 

3.2 Quiescent evolution 

Each galaxy is modeled by a 3-component vector, 

mcold , (1) 

mhot / 

where mstar is the mass of stars, mcoid is the mass of cold gas, and 
mhot is the mass of hot gas distributed within the host halo. We 
use the term 'quiescent evolution' to mark all the evolutionary pro- 
cesses of a galaxy, except those related to mergers. The quiescent 
contribution due to each physical process to the components of rii 
is designated by square brackets. For example, ["^coid] feedback 
the change in the mass of the cold gas due to feedback. 

Fresh supply of gas into the galaxy is provided by smooth ac- 
cretion along with the growth of the dark-matter halo mass. The 
rate of cold and hot accreted gas is modeled by 

[mcold]^c„.ctio„ = /ca ■ Mh (2) 
[mhot]^^„<,tio„ = ./ha ■ Mh . (3) 

Here Mh is the rate of dark-matter smooth accretion which does 
not include mergers with resolved progenitors (if Mh < we use a 
gas accretion rate of zero). The fractions /ca and /ha are assumed to 
be some functions of the halo mass and redshift, with the standard 
values being and 0.17 respectively (note that the sum /ca + /ha 
corresponds to the cosmic baryon fraction in all the models pre- 
sented in this paper). 

The mass of cold gas may increase due to radiative cooling of 
the hot gas. This cooling rate is modeled by 

^ We use the term 'formalism' to describe the general methodology we 
use, specific solutions that can be applied within this framework are called 
'models' 
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cooling 



/c ■ mhot ■ 



(4) 



The cooling efficiency, = fc{Mh,t), is a function of the host 
halo mass A4h and the cosmic time t only, and given in units of 
Gyr-^ This is a significant simplification over other approaches, 
neglecting the additional dependence on the gas metallicity and 
density, which is only implicitly included within the halo mass de- 
pendence. We will examine the difference between our approach 
and the usual SAM recipe in section|4]below. 

We assume that the SF rate is proportional to the amount of 
cold gas. 



"Icold 



fs ■ rricoid , 



(5) 



where fs = fs{Mh,t) is a function of the halo mass and time, 
in units of Gyr^^. For each SF episode we assume that a constant 
fraction of the mass is returned back to the cold gas component 
due to SN events and stellar winds. This recycling is assumed to be 
instantaneous, and contributes 

[w.cold]jet.yt,iing = — [^"'-star] re(.y(.ii„g = R [m,star]gp • (6) 

Here _R is a constant with typical value s in the range 0.4 — 0.7 . 
Stellar population synthesis models (e.g. lSruzual & CharloJI 20031) 
predict a recycling time of approximately 1 Gyr. As a result, our 
assumption of instantaneous recycling might change the cold gas 
fractions inside galaxies, and consequently the values of SF rates. 
We plan to test this assumption in a future work. 

Gas can be heated due to SN explosions and move from the 
cold gas into the hot. Assuming that SN events immediately follow 
star formation, this feedback should be in proportion to the SF rate, 

[^■hot] feedback ~ ~ ["^^old] feedback ~ C^) 
/d [m,star]gp = /d/sTTlcold . 

Here feedback is modeled by a function of halo mass and time /d = 
fdiMhyt). Other feedback mechanisms like feedback by AGN are 
not in general proportional to the SF rates, and are not explicitely 
modelled in our approach (although may be regarded as implicitely 
included in the net cooling rates). Note that we assume feedback is 
proportional to the value of SF rate before recycling is considered. 
We allow some of the gas to be ejected from the hot gas phase out 
of the halo: 



tion — /<= ['^starjgp — /c/sWlcold ; 



(8) 



where again the efficiency /e is a function of halo mass and time 
only. This gas is never allowed to fall back, so the baryonic mass 
inside haloes might be lower than the universal average. 

To conclude, each process is described by one function which 
depends on the host halo mass and time only. All processes dis- 
cussed in this section can be written in a compact form by using 
the following differential equations: 



m = Am + BMh 



where 

A = 

V 

( ° 

B = /ca 

V /ha 



(1 - R)fs 

-(1 - R)fs - /d/s 
/d/s - /c/s 





/c 



(9) 



(10) 



(11) 



We emphasize that A and B do not depend on m, resulting in a set 
of linear inhomogeneous differential equations. 



Photoionization heating of the intergalactic medium is as- 
sumed to suppress the amount of cold gas available for SF within 
low mass haloes. This effect is critical for modeling the formation 
of dwarf galaxies. The minimum halo mass of ~ 2 x 10^" H^^Mq 
in the Millennium simulation, which we use here, does not allow a 
detailed modeling of small mass galaxies. Consequently, we use the 
simple assumption that all the gas is kept hot until redshift 7 when 
cooling and SF processes can start. This information is included in 
the components of A and B. 



3.3 Mergers and satellite galaxies 

Satellite galaxies are defined as all galaxies inside a a FOF group 
except the main galaxy inside the central (most massive) (sub)halo. 
Once the (sub)halo con'esponding to a given galaxy cannot be re- 
solved anymore, it is considered as having merged with the central 
halo. Due to the effect of dynamical friction, the galaxy is then as- 
sumed to spiral towards the center of the FOF group and merge with 
the galaxy in the central halo after a significant delay time. 

At the last time the dark matter (sub)halo of a satellite galaxy 
is resolved we compute its distance from the central halo (r-gat), 
and estimate the dy namical friction time using the formula of 



and estimate tne ay namica 
iBinnev & Tremaind ( fT987l) . 



tdf = OlM 



Gmsat In (1 -I- Mh/nis; 



(12) 



For msat we use the baryonic (stars -I- cold gas) mass of the satellite 
galaxy plus the minimum subhalo mass which can be resolved by 
the Millennium simulation. Vv , Mh are the virial velocity and mass 
of the central subhalo. We add a free parameter Odf to this equation 
in order to reflect the uncertainties discussed in section |23] If a 
satellite falls into a larger halo together with its central galaxy we 
update tdi for both objects according to the new central galaxy. 

Given the simplifications used in the previous sections, one 
might argue that tdi should be formulated a function of halo mass 
and redshift only as well, without assuming any a-priori parameter- 
ization. However, it is very likely that such a function should also 
depend on the satellite mass, msat, resulting in at least three vari- 
ables for this function. This variable space might be too large for 
finding useful constraints. In addition, the level of accuracy in dy- 
namical friction estimates is becoming high enough, and will leave 
less room for freedom in the near future. 

While satellite galaxies move within their FOF group, they 
suffer from mass loss due to tidal stripping. The hot gas halo of 
the satellite should be the first baryonic component that will be 
stripped. We assume that all satellite galaxies are losing their reser- 
voir of hot gas exponentially, on a time scale of a few Gyr. In order 
to properly model this stripping we modify A by subtracting a con- 
stant a;j from one of its elements: 



Asat(3, 3) = — /c — aji , 



(13) 



Note that a constant in the diagonal of A gives an exponential time 
dependence. However, the actual dependence of rrihot on time for 
satellite galaxies is more complicated due to contributions from 
feedback and cooling. In general the parameter should depend 
on the dynamical time of the host halo. For simplicity we consider 
it to be a constant here. 

When galaxies finally merge we assume t hat a SF burst is 
triggered. We foll ow Mihos & Hemguis^ ( ll994 ): [Somerville et al.l 
( l200lh : ICox et alj ( |2008|) and model the amount of stars produced 
by 
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Arristar = Olbi ^ 1 (nil, cold + ^2, cold J 

V ma / 



(14) 



Here rrii are the baryonic masses of the progenitor galaxies (cold 
gas plus stars), mi, cold is their cold gas mass, and ab, are con- 
stants. 

The efficiency is computed before applying the recycling fac- 
tor R, so the net amount of stars added to the galaxy is Amstar ( 1 — 
-R). SF in bursts are accompanied by the same SN feedback as 
used for the quiescent mode of SF. The burst duration (Tburst, rnay 
vary between tens of Myr to a few Gyr depending on the merger 
mass ratio, and whether mu ltiple bursts are considered or just the 
main peak dCox et alj|2008h . We will specify below the time- scale 
adopted for each model. 



3.4 Usage: degrees of freedom, tuning, code 

The number of degrees of freedom available for tuning is much 
larger here than in any other SAM which assumes some fixed func- 
tional form for the recipes used. The quiescent evolution part in- 
cludes six functions, /c, /d, /a, /e, /ca,, /ha whcrc each may 
have any dependence on halo mass and time. There are additional 
few constants: R, Ofdf, Qh, ib, Qc which are mostly related 
to mergers and satellite treatment. The large freedom inherent in 
the formalism allows us to incorporate very different models, but it 
demands some starting point when tuning the model against obser- 
vations. As will be demonstrated below, for each model developed 
here most of the free parameters are fixed a-priori and are not tuned. 
Once a set of such assumptions is made we can explore the freedom 
in one or two components, and try to match the observational data. 

Our code is available for public usage through the internet (see 
http : / /www .mpa-garching . mpg . de /gal form/ sesam). 
Users are allowed to use any functional shapes for the quiescent 
ingredients by loading tables of efficiency values as a function of 
time and halo mass. Alternatively, simple power-law functions can 
be used. The current public version applies the model to a small 
simulation volum^ and provides results in just a few seconds. It 
also automatically compares the model results to observation. 

In order to solve Eqs.[9]-[TT]we divide the time-step between 
each simulation snapshot (which is roughly 250 Myr in the Millen- 
nium simulation used here) into 20 smaller steps (~ 12 Myr each). 
In each such small step we compute the incremental addition to the 
mass of stars, cold gas, and hot gas. We checked that using different 
number of time-divisions (5-100) does not change the results of our 
models significantly. This means that at any time-step the change 
in the masses of the galaxy components is small, and the models 
are numerically converged. An error message will appear in case 
the recipes for quiescent evolution produce negative masses. In this 
case, either the time-step, which is a tunable parameter, has to be 
increased, or the recipes have to be adapted. We found that conver- 
gence is achieved once the efficiencies values are smaller than one 
over the time-step size. 

Unlike in the quiescent evolution, negative masses are occa- 
sionally produced by our recipes describing star-bursts. Our mod- 
els usually assume that these bursts occur rapidly on time-scales of 
few tens of Myr. As a result, the feedback efficiency might be so 
high that it would be able to heat up more cold gas than is actually 
present within the galaxy. In these cases we simply limit the amount 



^ We use the Milli-Millennium simulation, which was run with the same 
parameters as the full Millenni um simulation, but within a box size of 62.5 
Mpc h.-l )Lemson et ai]|2006h . 



of heated gas to the amount of cold gas available after producing 
the stars in the burst. 



4 COMPARISON TO A FULL SAM 

In this section we compare our formalism to a full semi-analytical 
model (SAM). We examine how the results of the SAM change if 
the recipes are simplified to depend only on halo mass and redshift. 
The resulting change in the properties of the simulated galaxies will 
hint at the level of complexity that can be achieved by our approach. 
This will also indicate if our formalism can serve as a common 
language to describe other SAMs published in the literature. 

We choose to analyze as a reference the SAM developed by 
DLB07. This makes the comparison simple because the same set 
of merger-trees are being used, and because the results of DLB07 
are publicly available. We introduce a model which follows the for- 
malism introduced in section [3] and resembles DLB07. It will be 
named 'Model 0' hereafter. Our general approach is different from 
the DLB07 model mainly in parts related to the quiescent evolu- 
tion. Other components of Model which are related to mergers 
and satellite galaxies can be easily matched to resemble the details 
of the DLB07 model. We list these minor adjustments below: 

• Dynamical friction: A few modifications to Eq.[T2]are made in 
order to closely follow DLB07: Within Model 0, msat is the dark- 
matter halo mass of the satellite galaxy at the last instance when it 
was still resolved instead of the baryonic mass of the satellite used 
in other models in ij5]below. We also do not allow rsat to be larger 
than the virial radius of the central galaxy and we use Odf = 2. 

• Satellite stripping: we set a.^ from Eq.[T3]to a very large num- 
ber. This makes galaxies lose their hot gas to the central galaxy 
within the FOF group immediately as they enter the group. 

• Merger-induced SF bursts: once the galaxies merge, the ef- 
ficiency of SF bursts is implemented according to Eq. [14] with 
Qfc = 0.56 and Qc = 0.7. 

We do not modify our implementation of reionization in order to 
mimic DLB07 accurately, but keep it as described in our section[3] 
We verified that changing the details of this recipe does not affect 
the results that we will show below. 



4.1 Model 0: ingredients of the quiescent evolution 

A galaxy within the DLB07 model is assumed to have four main 
components: hot gas, cold gas, stars, and ejected gas. The ejected 
gas is assumed to have been ejected out of the hot gas halo due to 
the energy input by SN feedback. It will only be re-incorporated 
into the hot gas (and be susceptible to cooling again) after a cer- 
tain delay time. The inclusion of such a phase thus effectively de- 
creases the amount of gas that can cool to the galaxy. The ejected 
gas component is not a part of our formalism as presented in section 
[3]because it will not be used in the new models we will develop be- 
low. However, we can easily add it here and change the number of 
components in the model. This is implemented by replacing Eq.[T] 
with: 



va.o 



( mstar \ 
mcold 

mhot 
\ mcjct 



(15) 
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Figure 1. The full distribution of efficiencies for a given halo mass and 
time, as extracted from the DLB07 model. We plot the SF efficiency (/s) in 
solid line for galaxies identified at ; = 0. /c is plotted with dotted (dashed) 
line for i = 13 (7) Gyr after the big-bang. All galaxies are central inside 
their FOF groups, with host subhalo mass which equals 10^^ h^^ M^.The 
distribution of cooling efficiency is bi-modal (see text). 



We also need to increase the number of differential equations: 



An = 



/ 



V 



(1 - R}fs 

-{1-R)fs-Ufs 
/d/s - /c/s 
/o/b 



(16) 





/c 
-/re / 

Here /rc is the efficiency for re-incorporation of gas from the 
ejected phase back to the hot gas within the halo. We use 



Br 



I 





0.17 




(17) 



which means that all the fresh gas that is accreted along with the 
smooth dark-matter is added to the hot phase first. 

Some of the ingredients of Ad are already given by DLB07 
simply as functions of halo mass and time. These include the feed- 
back efficiency /d = 3.5; a s well as ejec t ion ef ficiencies (/o and 
/re) which are described by ICroton etal] | |2006|) , Eqs. 20 & 2fl 
On the other hand, the cooling (/c) and SF (/s) efficiencies are 
complicated and depend on various extra parameters. In order to 
replace these recipes with functions of halo mass and time we run 
the DLB07 model over the Milli-Millennium simulation. We record 
SF and cooling efficiencies at each time-step for all galaxies which 
are central within their FOF group. This ensures that our cooling 
is not affected by the specific treatment of satellite galaxies. Me- 
dian values are then computed for bins of halo mass and redshift. 
We describe below some details of these recipes, and the values we 
adopt. 

The cooling efficiency in the DLB07 model is computed ac- 
cording to the growth of the 'cooling radius', which describes the 
maximum radius at which the hot gas density is still high enough 
for the cooling to occur within a halo dynamical time. This radius 

4 We found that /o = 21 .A&M~^-^''^t'^'^^ - 3.5 and /rc = 2.86t-''-327 
Here Afio is the subhalo mass in units of 10^" h^^ Mq and t is in Gyr 



is determined using the effi ciency of radiative cooling as given by 
ISutherland & Dopitd ( [T993h . which depends also on the metallic- 
ity of the hot gas. Moreover, as the density profile of the hot gas is 
assumed to adjust itself instantaneously to the total amount of hot 
gas, cooling rates goes like fcmli^^ and not like /cmhot as assumed 
here. As a result, cooling efficiencies are not simply a function of 
halo mass and time. In addition, AGN feedback can decrease the 
cooling efficiency in the DLB07 model, and practically shuts off 
all cooling for high mass haloes. 

In Fig. [6] we plot median values of cooling efficiencies from 
the DLB07 model according to the procedure outlined above. For 
each galaxy we first average the efficiency values within one snap- 
shot, going over 20 time-steps which are used to integrate the model 
equations. As seen from the figure, gas is not allowed to cool in 
haloes with masses above ~ 10^^ h'^MQ. This is due to AGN 
feedback implemented by DLB07. For lower halo masses the cool- 
ing efficiency is almost constant with halo mass, and increases dra- 
matically only at early times (z > 1, t < 7 Gyr). In order to mimic 
accurately these cooling efficiencies we take the specific numerical 
values shown in Fig. |6] and use them to define /c in our model. 
These values are also given explicitly in Appendix A. We have 
checked the dependence of cooling on gas metallicity in DLB07 
and found it to be small for a given halo mass and redshift. 

Efficiencies shown in Fig.[6]are median values computed from 
a large statistical sample of galaxies. It is interesting to exam- 
ine the full distribution of /c, for a given bin of time and mass. 
In Fig. [T] we show the distribution of cooling efficiencies from 
DLB07, for haloes of mass ^ 10^^ /i^^Mq. Surprisingly, the 
cooling efficiency is bi-modal, showing two distinct peaks, even 
though the halo mass is within the 'fa st cooling' regime, where 
cooling should be highly effective (see iBirnboim & Dekell [20031 : 
ICroton et alj|2006l) . We explored this feature in detail, and found 
that the bi-modality comes from the evolution of the cooling radius 
with time. All galaxies that live inside small mass haloes experi- 
ence periods of fast and slow cooling rates within a few time-steps 
(the fast cooling episode usually last one time-step, ~10 Myr). This 
is due to the fact that the cooling radius is proportional to the square 
root of the mass of the hot gas, and gas is assumed to be distributed 
instantaneously within the halo (see lCroton et al.ll2006l Eq. 3 & 4). 

Both DLB07 and our model assume that the SF is proportional 
to the mass of cold gas in the galaxy. However, DLB07 uses an 
estimate of the disk radius which depends on the spin parameter 
of the halo, and cannot be represented by a simple dependence on 
halo mass and time. In addition, DLB07 incorporate a threshold 
mass of cold gas, below which stars cannot form. This threshold 
mass is defined by DLB07 to be a function of halo mass and time 
only. Consequently, in this section we will use a modified SF law, 
namely 



SFe 



"Istar = /s,D ■ (Tlcold 



(18) 



Using the above method to record cooling and SF efficiencies we 
found that the median SF efficiency used by DLB07 is accurately 
described by an analytic fitting function: 



/s,D 



10 



-6.5-0.0394[log J\/fel^ 



(19) 



It should be noted that the two contributions from Mh almost can- 
cel each other out, which means that the overall dependence on Alh 
is small, as can be seen in Fig. [7] In Fig. [T] we show the full distri- 
bution of SF efficiencies for a given bin of mass and time. This 
distribution is relatively narrow, and does not show any significant 
features. 
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Figure 2. Stellar mass functions for Model (solid) compared to DLB07 
(dashed). Both samples ai'e based on the Milli-Millennium sim ulation. The 
thick gray line at z = shows the observational results of iLi & Whitel 
For each different redshift we multiply the y-axis by a constant for 

clarity. 



Figure 3. Specific SF rate (SSFR) versus stellar mass for various redshifts 
as indicated. Model galaxies are plotted in solid lines, DLB07 galaxies 
are plotted in dashed lines. Contour levels are at 0.01 and 0.1, when the 
normalization is such that the integral over all values gives unity. A mini- 
mum SSFR of 10^* Gyr^^ is used for all galaxies in this plot. 



The threshold mass used by DLB07 can be parameterized as 
a function of halo mass and time by 

™ ^-1 in-8-61 ii,r0.68 ,-0.515 

"Icrit = ,/s,D 10 A'i,, t . (20) 

Here t is the time since the big-bang in Gyr, and Mh is the subhalo 
mass in units of h~^MQ. We use a recycling factor, R = 0.43, as 
is done in DLB07. 

4.2 Model 0: simulated galaxies 

Our model is based on a completely different numerical code than 
the model of DLB07. In order to perform the comparison below we 
applied our Model to the same merger-trees as in DLB07 (i.e. the 
Milli-Millennium simulation, with box size of 62.5 Mpc h~^). We 
use recipes according to our basic formalism presented in section 
[3] with details as explained in the previous sections. We note that 
occasionally the mass of a component in a galaxy gets negative 
when applying the recipes above. In these cases we set the negative 
mass to zero as is done in DLB07. 

We compare stellar mass functions from the original DLB07 
to the galaxies in our simplified model in Fig.|2] For most masses 
and redshifts the stellar mass functions are very similar. One no- 
ticeable difference can be seen around 10^^ Mq, where our model 
has a peak at all redshifts. This peak might be related to the fact 
that we use the median value of the cooling efficiency to represent 
a complex distribution. 

In Fig. [3] we compare specific SF rates for galaxies in our 
model to those from the DLB07 model. The results are very sim- 
ilar. Interestingly, the scatter in SF rate for a given stellar mass is 
nicely reproduced by our model. This shows that most of the scat- 
ter in the properties of galaxies is not introduced by the scatter in 
the efficiencies of the underlying processes, but instead caused by 
the large variations in the formation histories of dark- matter haloes. 
The fraction of passive galaxies is shown in Fig.|4]and also follows 
DLB07 reasonably well. We note that the results of SF rates im- 
prove for higher redshifts. Lastly, we show in Fig. |5] the average 
fraction of cold gas in Model versus DLB07. The agreement is 



1 
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Figure 4. Passive fraction of galaxies at z = 0. Solid line shows Model 
galaxies, dashed line shows DLB07. Passive galaxies are defined to have 
SSFR lower than the dashed line in Fig. [TT] panel I, which is also defined 
in E g. 1 25 1 Observationa l data is plotted in shaded g ray region and is taken 
from lsalim eTal] j2007h : ISchiminovich et all j2007l) . 

again reasonable, with the exception of the feature visible at a stel- 
lar mass of 10^^ Mq. This feature is likely related to the feature in 
the stellar mass function discussed above. 

All the results plotted here show that our model galaxies are 
very similar to the galaxies produced by DLB07. Apparently, using 
efficiencies which depend on halo mass and time does not seem 
to reduce the level of complexity of the model. There are a few 
deviations between the models, which is not unexpected given the 
extent to which we have simplified DLB07. Note that it is likely that 
our simplified model could be brought to even closer agreement 
with DLB07 if we allowed some freedom in choosing cooling rates, 
i.e. use cooling rates which deviate slightly from the median of 
the distribution. Indeed, in section |5] we show that we can obtain 
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Figure 5. Average gas fractions, mcoid/^star, for Model (solid) and 
DLB07 (dashed) galaxies. We plot the log of the average over all galaxies 
at redshift zero, for a given stellar mass. 



smooth stellar mass functions when using a single value for the 
cooling efficiency in each stellar mass and time bin; the features 
seen in the stellar mass function and cold gas fraction at lO'^^ Mq 
are thus not related to the fact that we do not use a bimodal cooling 
efficiency like DLB07 do. 



5 THE MODELS 

We use the formalism presented in section [3] and develop specific 
models that will be able to produce galaxies which agree well with 
the observational data. In this section, we present five models that 
represent a large range of scenarios. We discuss the motivation for 
each model, and the various ingredients being used. We compare 
the results of the models to observation in section [6] It should be 
noted that our tuning is not meant to be perfect, and it might be that 
other models similar to ours produce galaxies which fit observa- 
tions in a better way. A summary of all the models developed here 
can be found in Table [T] 

5.1 Default ingredients 

We list below the default ingredients of the models. These are com- 
mon to most of the models discussed in this work. Deviations from 
these basic ingredients will be addressed specifically for individual 
models. 

• Hot gas that is being stripped from satellite galaxies is lost to 
the intergalactic medium and is not added to the hot gas reservoir of 
the central galaxy. Although other SAMs usually add the stripped 
gas to the central galaxy, it might be that tidal interactions will eject 
some of this gas out of the halo. Also, this effect could be counter- 
balanced by changing cooling rates for high-mass haloes. We use 
ah = 0.25 Gyr^^ which mimics a stripping with an exponential 
time-scale of 4 Gyr for all satellite galaxies. 

• The dynamical friction time t^i from Eq.[T2]is multiplied by 
a factor of ctdi ~ 3. 

• The efficiency of merger-induced SF bursts is given by at = 
0.56, Oc = 0.7 as suggested by DLB07. The burst duration, 
Cburst, is assumed to be very short, at the level of 10 Myr. 




Figure 6. Cooling efficiencies used in our various models. The gray scale 
shows Log values of fc in units of Log[Gyr^ ^ ] . X-axis represents the time 
in Gyr since the big-bang, y-axis is the host halo mass. Each panel is devoted 
to a different model as indicated (see summary of all the models in table[T). 
For haloes more massive than shown here we use the same efficiencies as 
for the most massive bin. Cooling e fficiencies in the bottom rig ht panel 
were obtained by running the SAM of toe Lucia & Blaizoll j2QQ7h and are 
discussed in section |4] We use a bilinear interpolation scheme to obtain 
logarithm values for intermediate masses and times. Specific values for the 
efficiencies plotted here can be found in Appendix A. 



• The ejection efficiency, fc, is set to zero. 

• We use a recycling factor of R — 0.5. This value is ob- 
tained using the iBruzual & Chariot 1 20031) stellar population syn- 
thesis model, adopting a lChabried ( l2003h IMF. 

• All gas is accreted to the hot gas phase, so /ca = 0, /ha ~ 
0.17. 



5.2 Model I - zero SN feedback 



In section [ZT] we argue that the ef ficiency of cooling mig ht be dif- 
ferent from the values predicted bv lWhite & FrenkI l ll99I ). Here we 
would like to test a scenario in which this efficiency has its mini- 
mum possible value. It will be interesting to have a constraint on 
the minimum cooling rates from observations. We can then exam- 
ine if these values fall within the range of uncertainties discussed 
in section l2n 

A main degeneracy in modeling the evolution of galaxies is 
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Table 1. A summary of the new models discussed in this work. The models are presented in section|5] and in Figs.|6]&|2] The results are summarized in section 
[6] Figs.[8l- [T5] Components which are considered as free parameters, and used to tune the model against observations are written on a gray background. 
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that cooling can be balanced by feedback processes which heat up 
the cold gas. Thus, a minimum cooling scenario must be achieved 
when the feedback efficiency is set to zero, while observations are 
still reproduced. In this case, the total mass in cold gas and stars 
depends only on the cooling efficiency. Consequently, if the total 
mass of stars and cold gas is known for a statistical sample of galax- 
ies, we can define the minimum cooling rates in a relatively unique 
way. Unfortunately, we only have observational estimates on cold 
gas fractions at very low redshifts. Nevertheless, we find that those 
are already a significant constraint, as they limit the sum of cooling 
efficiencies over all times. We set the efficiency of merger-induced 
bursts to zero, in order to further increase the degree of uniqueness 
in this problem. 

We tune the cooling and SF efficiencies iteratively. Once we 
have assumed certain cooling efficiencies, we tune the SF efficien- 
cies to get the proper stellar mass functions. If the resulting amount 
of cold gas differs from observations we then change the cooling ef- 
ficiencies again, and re-tune SF accordingly. Only a few iterations 
are needed in order to converge to the solution given below. 

The cooling and SF efficiencies of this model are shown in 
Figs.[6]&|7] As expected, cooling efficiencies in this model need 
to be very low, and are roughly 1-3 orders of magnitude smaller 
than the standard values adopted by the usual SAMs (values from 
the SAM of DLB07 are shown in the same figure). Such a low effi- 
ciency might be explained by substantial preheating. For high red- 
shift and massive haloes (z > 1, Ah > 10^^ h'^ Mq) the cooling 
efficiencies used in Model I are only roughly an order of magnitude 
lower than the standard ones, which might be a plausible difference 
relative to the standard cooling rates even in the absence of preheat- 
ing. An additional characteristic of cooling efficiencies is that they 
are peaked at a halo mass of ~ 6 X 10" h-^KlQ at all redshifts. 

Interestingly, we can see in Fig. [7] that our zero feedback 
model requires a star formation efficiency much higher than in 
DLB07 at high redshifts. This is necessary in order to obtain rea- 
sonable cold gas mass fractions at z = 0: If star formation efficien- 
cies were brought down, this would have to be counterbalanced by 
higher cooling rates at high redshift, which would in turn lead to 
too high gas fractions at z = 0. 




t [Gyr] 

Figure 7. SF efficiencies values (/s) used in our various models in units 
of Log[ Gyr^^ ]. This figure is similar to Fig.|6] The functional shape of 
DLB07 is given in Eq. 1191 Specific values used by Model 1 are given in 
Appendix A. 



5.3 Model II - standard 

In Model II we adopt feedback and SF efficiencies which are sim- 
ilar to that of DLB07 (see section |4ll. However, we differ from 
DLB07 in three respects: (i) we do not include an ejected phase, 
(ii) we do not include threshold for SF, and (iii) the hot gas of satel- 
lite is stripped exponentially with a relatively long time-scale of 4 
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Gyr. We then look for cooling efficiencies which produce galaxies 
that fit the observational data given the above assumptions. 

As in Model I, cooling rates within Model II have to be low in 
order to reproduce observations. These are shown in Fig. [6] Com- 
pared to DLB07 cooling rates are lower by up to two orders of 
magnitudes (instead of the three order of magnitudes in Model I). 
This means that the mechanism of ejection actually has a stronger 
impact than the pure 'reheating' by supernova feedback; reheating 
by supernova seems to allow to increase cooling rates by one order 
of magnitude; ejection by two orders of magnitude. 



5.4 Model III - cold accretion 

In Model III we include a prescr iption for cold a c cretio n 
which follows the ideas described by Birnboim & Dekel 1 2003h. 



which lollows t he ide a s described b y Hjirnboim & UekeJI liUUJI). 
ICattaneo et al] ( |2006|) . iKeres et alj ( |2009|) and iKhochfar & SiM 
1 20091) . In order to model the infall of cold gas into the disk, we 
assume that mhot represents the mass within cold filaments flow- 
ing with virial velocity into the disk. In this case, the rate of tran- 
sition from TTihot to rricoid is fixed by the infall time, which we 
approximate with the dynamical time of the halo. Consequently, 
the 'cooling' efficiency is now: 



/c.max — l/'^dyn 



0.15t 



(21) 



This sets the rate at which gas that joins the halo reaches the disk 
and becomes available for star formation. This is also the maxi- 
mum cooling efficiency that can be used. From Fig.[6]it can be seen 
that the cooling efficiencies used by DLB07 in the "cold accretion 
regime" are slightly lower (usually by 0.2 dex, 60%) than the max- 
imum efficiency /c,max defined here^ 

We roughly adopt the results o f lKeres et alj l l2009l) which indi- 
cate that cold accretion is dominant at halo masses below few times 
10" h~^MQ, and use maximum cooling efficiencies below a mass 
of ~ (1 +z) X 10^^ H^^Mq as can be seen from Fig.[6] We use the 
same SF efficiency as in DLB07 (see section|4]and Fig.|7]l, except 
that we set the SF efficiency to zero for haloes more massive than 
3 X 10^^ h~^MQ at redshifts smaller than 0.77, in order to pro- 
duce the proper number of passive and massive galaxies. This may 
mimic an AGN feedback in massive haloes which heats up all the 
cold gas in the disk, and does not allow any additional accretion of 
cold gas coming from satellite galaxies. 

Both cooling and SF efficiencies are thus set by our prelimi- 
nary assumptions. We are now looking for a SN feedback efficiency 
that will make this model successful in reproducing the population 
of galaxies. In order not to confuse the heated gas due to feedback 
with rfihot which is actually the filament mass here, we use the 
ejection efficiency /c and remove all the heated gas from the halo. 
We find that a simple power-law gives reasonable results: 



/c = .fd = 10' • Mro 



(22) 



where Mio is the subhalo mass Mh in units of 10^° h~^MQ. We 
limit this efficiency to a maximum value of a 100 Gyr~^ in order 
for the code to converge within 20 time-steps, as discussed at the 
end of section|3.4| 



5.5 Model IV - only bursts 

The star formation histories of galaxies are a combination of short 
episodes of bursts and long term quiescent evolution. These two 
processes are triggered by different mechanisms originating from 



the dark-matter merger history. Bursts are assumed to follow merg- 
ers, while quiescent evolution is due to smooth accretion of gas 
into galaxies. The relative contribution of these two mechanisms to 
the total stellar mass formed is observationally only partially con- 
strained (for example, the total mass of stars within thin disks at 
low redshift might give an estimate for the total quiescent SF his- 
tory). Here, we investigate if a model where stars are made in bursts 
can be distinguished from the other more standard models. We will 
therefore construct an extreme model in which all stars are formed 
in SF bursts, hoping that it will reveal some of the basic differences 
between the merger and the quiescent mode of SF. These differ- 
ences may then be used in future work to distinguish between more 
reasonable models, where b ursts contribu t e muc h less to the overall 
SF history of the universe. iNoeske et al. I l l2007l) . for example, use 
the scatter in the SF rates of galaxies at z = 1 to infer that starburst 
in gas-rich major merger are not a dominant mode of star formation 
at these redshifts. 

We set the feedback efficiency and quiescent mode of SF 
to zero. Cooling efficiencies are adopted from the cold accretion 
model. The only freedom in this model is in fixing the dynamical- 
friction time, the burst duration, and burst efficiency. We find that a 
good match to observational data is obtained if we use 



Qdf 



13.6 



(23) 



for the dynamical-friction time (Eq.lI2t. We assume the burst de- 
pendence on time has a Gaussian shape, with a standard deviation 
of 



•^burst 



t 



13.6 



Gyr. 



(24) 



The peak of the SF efficiency is defined to happen at 2crburst af- 
ter the galaxies coalesce. The reader is referred to section |23] for 
a discussion about uncertainties in the above recipes. Lastly, we 
modified the burst efficiency and use Ob ~ 0.8, Oc = 0.7 only for 
haloes below a mass of 10^^ h'^MQ.More massive haloes are not 
allowed to have any merger-induced bursts. 



5.6 Model V - shutdown by mergers 

Different scenarios for the build-up of red galaxies (the so- 
called 'red sequence') are still under debate. Four main chan- 
nels for galaxies to become red have been suggested: (i) 
major mergers can trigger bursts of SF, exhaust the gas 
reservoir, and trigger AGN activity jToomre & Toomrd 1 1 9721 : 
iMihos & HemquisllTTggi ; iHopkins et al.ll2008l) : (ii) above some 
halo mass cooling r ates may be decreased due to AGN feed- 
back or virial shocks ( Birnboim & Dekel 2003; Dekel & BirnboimI 
20061; ICattaneo et al.ll2006l; iKeres et al.. ,2005. ; .Bower et aL,2006l : 



Croton et alj |2006|; I Somerville et alj 120081) ; (iii) morphological 
quenching "l lMartig et al .1120091) (iv) environmental effects acting on 
satellite galaxies. 

Here we present a model in which major mergers completely 
shut off both cooling and star formation for all time thereafter, 
which could be explained by a combination of scenarios (i) and 
(iii) above. This is in contrast to the previous models in which SF 
and cooling is quenched above a given halo mass (see e.g. Fig.[6t 

This model is similar to the model by iHopkins et al.l ^20081) 
who have suggested that major mergers of gas-rich systems may 
lead to permanent quenching of star formation. These authors 
compared their results to the standard semi-analytical models (e.g 
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Figure 8. Stellar mass functions of galaxies at various redshifts, for our different models as indicated in the top left panel and as summarized in table[T] The 
observational data is plotted with gray symbols. At all redshifts hig her than zero we c onvolve t he model ste l lar m asses wit h a Gaussian error d istribution, 
with standard deviation of 0.25 dex. At z = we use observa tions bv lLi & White! i2009|. circles). [Baldrv et all i2008l crosses). IPanter et"al] j2007|. pluses). At 
high-2 we use the following observations: iBundv et alj j2006l z = . 75 - 1 . circles). Ifiorch et alj ilOaS. z = 0.8 — 1, cros sesx lp^re^ ^onz^lez et alj i2'0Q8l 
^ = 0.8 - 1, z = 1.6 - 2 z = 2.5 - 3, plu s signs). iFontana et al.l Jiool z = 0.8 - 1, z = 1.6 - 2, z = 2 - 3, sti ffs). lOrorv et alj 1 2004 z = 0.8 - 1, 
upward-pointing triangles). lbrorv et aljj200l z = 0.75 - 1.25, z = 1.75 - 2.25, z = 2.25 - 3, diamonds and squares). iMarchesini et alH2009l z = 1.3 - 2, 
z = 2 — 3, right-pointing triangles). Model stellar mass functions are plotted at z = 0, 1, 1.5, 2.5 according to the label on each panel. We treat the specific 
IMF chosen in each measurement as part of the observational 'uncertainty' and do not convert them into the same IMF. 



DLB07; ICattaneo et ai]|2006l : ISomerville eTaLll2008l) which basi- 
cally assume that star formation is quenched above some fixed halo 
mass. They claim that their model leads to an improved agreement 
with observations for (i) the passive fraction as a function of stellar 
mass and halo mass (ii) the number of massive and passive galax- 
ies at high redshift. Here we try out a similar model. We start from 
Model II (i.e. similar to DLB07), but assume that any galaxy un- 
dergoing a major merger (with a mass ratio below 5:1) has its cool- 
ing and star formation efficiency set to zero permanently. Not even 
merger-induced bursts are allowed anymore. This makes it possible 
to increase cooling rates substantially for massive haloes, as visible 
in Fig. [6] Regarding all other recipes, this model is equivalent to 
Model II. 



6 RESULTS 

In this section we discuss the results obtained from the five models 
presented in the previous section using the full Millennium simula- 
tion. When tuning the models we examine the same set of figures as 
seen here for each model run. In the first part of the tuning we try to 
match the stellar mass functions and the universal SF density plots. 
As a second priority we consider the distribution of SF rates, the 



cold gas fractions, and the passive fraction of galaxies at z = 0. It 
might be that our models depend strongly on the priorities we give 
different observational constraints. It might also be that within the 
assumptions of each model there are other possible models provid- 
ing equal or better results. The main goal in tuning these models is 
to point out the interplay between different recipes, and to establish 
a set of models for future studies (e.g., merger rates, bimodality in 
SF). We do not claim to provide an excellent match to the observa- 
tional results. However, the majority of our models give similarly 
good agreement with observations as other current SAMs. 



6.1 Stellar masses 

The stellar mass function is the most basic physical constraint 
on the population of galaxies. Observationally, measuring stel- 
lar masses for galaxies is not a trivial task. It involves various 
steps with non-negligible uncertainties. The transformation of pho- 
tometrical data to stellar masses, the assumed IMF, dust correc- 
tions and cosmic variance may all contribute to significant devia- 
tions between diff erent studies (see for example the discussion in 
iBaldrv et akhoOSl) . 

In Fig. [8] we plot stellar mass functions for all our models 
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Figure 9. The conditional stellai' mass functions, for a given FOF lialo mass 
at 2 = 0. Model results are plotted accordin g to line-styles which are given 
in table[T] The data from lYang et alj )2009l) is plotted in thick gray lines. In 
each panel we quote t he relevant range in halo mass in units of logarithm of 
h~^MQ according to lYang et alj )200^ . We transform these ranges to the 
halo mass used by the M illennium simulation according to equation 2 from 
IWeinmann et alj j200d) . 



Figure 10. The universal SF rate density. Model line styles are described 
in table [T] Gray shaded regions show observational data: th e upper region 
is taken from the compilation bvlHopkins & Be acomH2006l 3a confidence 
level); the lower region is taken from Wilkins et alj <2008l la confidence 
level) and corresponds to the universal SF rate density derived from the 
evolution of the stellar mass functions with z- 



against observations in various redshift bins. It can clearly be seen 
that all our models fit the stellar mass functions relatively accu- 
rately, and, most importantly all models have almost the same mass 
functions. This indicates that although the stellar mass function 
over time constrains the parameters of the models, it can be repro- 
duced rather easily with very different basic scenarios. There are 
few deviations between our model results and observational data, 
mainly at high z and for low mass galaxies. In addition, Models 
IV and V slightly overpredict the high mass end of the stellar mass 
function at z = 0. The overall consistency is surprising given the 
large differences between the models. 

The assumption about the shape of the stellar initial mass 
function (IMF) is an important ingredient in the observational anal- 
ysis. However, our model does not include an assumption about the 
IMF except for the relatively small dependence on the recycling 
factor, R. Consequently, we do not attempt to correct the data to 
the same IMF and we see it as part of the observational uncertainty. 
It might be that only parts of the data sets are fitted properly by 
our model, indicating that some specific IMF solutions are pre- 
ferred. We do not try to explore this issue further here. Changes 
in the IMF between different observational results might appear 
as inconsistencies within the data (compare the high-mass end at 
z = 0.8 — 1.2 versus z ~ 0). These issues are less important here, 
as we are mainly concerned with comparing our models to different 
observational predictions. 

The stellar mass function obtained from Model IV indicates a 
severe resolution problem below '--^ 5 x 10^ Mq. Only this model 
shows such a problem which is connected to the fact that contri- 
butions from mergers are much more sensitive to the merger-tree 
resolution than other processes. This indicates that the minimum 
galaxy mass which is properly resolved in our model strongly de- 
pends on the actual parameters being used. 

More detailed information on the stellar masses of galaxies is 
obtained by splitting the mass function into galaxies which live in- 
side different halo masses. In Fig. |9l we compare the conditional 



stellar mass function fo r a given ha l o mas s obtained with our mod- 
els to the one given by I Yang et all ( |2009|) . Agreement with obser- 
vations seems to be reasonable for all our models, except in the 
lowest halo mass bin, where our models predict a lower stellar 
mass to halo mass relation for the central galaxies. These devia- 
tions might be due to the limited accuracy to which the halo mass 
can be measured, o r due to the different cosmological parameters 
used by lYang et alj ( 12003) and by the Millennium simulation. We 
also note that the somewhat too high number of small mass satel- 
lites at z = may be correlated to the slight overprediction of the 
stellar mass functions at high-z. This is because satellite galaxies 
today were central galaxies at high-z and thus were dominating the 
stellar mass functions at high-z. 

6.2 Star formation rates 

In Fig. [To] we show the universal SF rate density as a function 
of redshift. Again, it is interesting that all our models seem to be 
relatively similar, and they all lie within the observational uncer- 
tainties. We show two kinds of observational r esults here. The first 
comes from directly measuring the SF rate jHopkins & Beacoliil 
[2OO6), and the second from differentiati ng the stellar mass den- 
sity of the universe with respect to time l lWilkins et alj|200^ . We 
have not managed to simultaneously reproduce the stellar mass 
functions over time and the high SF rate density found in di- 
rect measurements. This inconsistency was already noted by e.g. 
IHopkins & Beacom (2006); Wilkins et al. (2008). 

In Fig. [TT] we show the distribution of the logarithm of the 
specific SF rate (defined as the SF rate divided by the stellar mass; 
hereafter SSFR) as a function of stellar mass at z = 0. Here we 
start to see larger deviations between our different models, indicat- 
ing that this relation is much more constraining than the stellar mass 
functions or the SF rate density of the universe. Unfortunately, ob- 
servations are relative ly uncertain, espe cially for passive galaxies 
(see the discussion in ISalim et alJboOTh . Note that both observa- 
tional data sets use volume-corrected values. 



The Degeneracy of Galaxy Formation Models 1 5 




Figure 12. The probability distribution of specific SF rates (SSFR) for differe nt bins in stellar mass at z = 0. Average stellar mass in Log Mq is indicated 
at the top of each panel. Observational results from lSchiminovich et alj i2007h are shown in thick gray lines. For SSFR values below 10~'^'^ Gyr~^ we plot 
the observation in dashed lines in order to emphasize the fact that these estimates are highly uncertain. Results for the model galaxies are plotted according 
to table[T] We use a minimum SSFR value of 10~* for all the model galaxies. In order to emphasize the shape of the distributions we normalized all lines to 
represent the same number of galaxies within a given mass bin. 



Interestingly, all our models manage to reproduce the tilt of 
the star-forming sequence, which is not reproduced by other cur- 
rent S AMs we are aware of l lSomerville et alfcoOSljFontanot et alJ 
This is somewhat surprising, as our models were not specifi- 
cally tuned to reproduce this tilt. We suspect that the absence of the 
tilt in most standard SAMs is related to the inclusion of the ejec- 
tion phase, as our Model from section |4] which mimics DLB07 
does not reproduce the tilt either. We have checked that removing 
the process of ejection in Model results in the appearance of a tilt 
in this model, although the stellar mass function is no longer simi- 
lar to observations. In order to correct the stellar mass function we 
need to modify the cooling efficiencies, as is done in Model II. We 
plan to further explore the SSFR versus stellar mass diagram, and 
the origin of the tilt in the star-forming sequence, in a future study. 

There are a few differences seen in Fig. [TT] between the ob- 
served and model distributions which are somewhat artificial and 
do not represent important discrepancies. Uncertainties in measur- 
ing stellar masses and SSFR can be modeled by multiplying our 
model results by a random error distribution. This will make the 
2-dimensional distribution wider, and will probably better fit the 
observations results. In addition, we do not allow SSFR to go be- 
low 10"* Gyr"^ in the dia gram. This results in a strong artificial 
peak at this SSFR value. Such a peak is not seen in the observa- 
tional data, where passive galaxies may have SSFR values smeared 
up to few times lO""' Gyr^^. 



In Fig.[T2]we show the distribution of SSFR for all our mod- 
els, in 9 different stellar mass bins, at z = 0. This figure shows that 
although the general tilt seen in Fig.[TT]is reproduced, for low stel- 
lar masses below 10^° Mq our model star-forming galaxies have 
lower SSFR than the observed ones with a difference of ~ 0.3 
dex. The SSFR values for passive galaxies (lower than ~ 10^^ 
Gyr~^) are highly unconstrained by the observations. However, it 
is interesting that each model shows a very unique behaviour in this 
regime. Once observations will be able to provide more accurate 
estimates of SF rates for passive galaxies, this information may be 
useful in constraining galaxy formation models. In Ap pendix B we 
show a similar figure, but comparing to the results of ISalim et al.l 
( l200'7h . 

In Fig. [13] we compare the fraction of passive galaxies as a 
function of stellar mass in our five different models. Passive galax- 
ies are defined according to the line shown in panel I of Fig. [TT] 
This line was drawn to roughly divide the population of passive 
versus active galaxies, and it corresponds to 

log(SSFRpassivc) = -0.48 log JTlstar + 2.88 , (25) 

where jristar is in units of A/q . While this observational quantity is 
rather uncertain, which is reflected in the al ready quite large d iffer- 
ence between passive frac tions derived by ISalim et aTl ( l2007h and 
ISchiminovich et all ( l2007b (which are both based on the UV-data 
of the same set of galaxies, using different methodologies), it could 
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Figure 11. The two-dimensional distribution of specific SF rate (SSFR) 
versu s stellar mass at z = 0. Ob servational data were graphically extracted 
from lSchiminovich et al.l j2007t) and are plotted in gray lines. Model re- 
sults are pl otted in solid thick l i nes. T he data from Salim et a l. ([2007;) is 
plotted over lSchiminovich et alj j2007h in the bottom right panel, in order 
to demonstrate the im portance of observa tional uncertainties (we slightly 
smooth the data from ISalim et alj i2007h in order to make the contours 
readable, this does not change the contour average location). In all pan- 
els contours show the regions which encompass 38%, 68%, 87% and 95% 
of galaxies. We use a minimum SSFR value of 10^** for all galaxies in this 
plot, as observations are not sensitive to lower SSFR. We define galaxies as 
being 'passive' if their SSFR falls below the dashed line shown in panel I. 
This line is a rough estimate drawn by eye, and is given in Eg. 1251 

be powerful to constrain SAMs. In all our models, except Model 
IV, the increase of the passive fraction as a function of stellar mass 
is too steep. This is probably due to the fact that all galaxies in 
high mass haloes are quenched in our models in the same way. At 
low stellar masses, on the other hand, all our models overproduce 
the fraction of passive galaxies significantly. This might be a result 
of the difficulty of detecting low mass red galaxies in the obser- 
vational data, or the specific treatment of satellite galaxies in our 
models. We note that a similar trend is found in the DLB07 model, 
shown in Fig.|4] 

6.3 Cold gas masses 

In Fig. [141 we show the cold gas mass function at z = for our 
different models. This observation provides important additional 
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Figure 13. The fraction of galaxies which are passive out of all the galax- 
ies within a stellar mass bin at j = 0. Passive galaxies are defined to have 
SSFR lower than the dashed line in Fig.[TT] panel I (see also Eq. l25t. Obser- 
vation al data is plotted in shaded g ray region and is taken from lSalim et alj 
l'2007*): ISchiminovich et alj i2007t) . Results from our various models are 
plotted according to the usual line-types. 

information about galaxy formation which can potentially be used 
to discriminate between models. Unfortunately, observational con- 
straints are still uncertain as they only measure the H I gas content. 
In order to obtain the mass function of the total cold gas, the mass 
of molecular gas n eeds to be estimated. This is done here using the 
model results from lObreschkow & Rawlingsl j2009h . Clearly, both 
the zero feedback, and the "shutdown after mergers" models tend 
to overproduce the number of objects containing large amounts of 
cold gas. In both models, a mechanism like for example AGN feed- 
back that heats up cold gas above some halo mass, or after a ma- 
jor merger, could solve this problem; we have not included such a 
mechanism for simplicity. 

Average gas fractions as a function of stellar mass are shown 
in Fig . [TS] along with the observational results from lCatinella et al] 
( 120091) . Gas fractions can help us to break the degeneracies between 
cooling and star formation efficiencies. Clearly, the gas fractions 
are much too high in the "shutdown after merger" model; how- 
ever, observations are still too uncertain to distinguish between our 
other scenarios. Future, more detailed observations of the cold gas 
fraction including both H I and molecular gas will therefore be an 
extremely important test for SAMs. 



7 SUMMARY AND DISCUSSION 

In this paper we develop a new formalism for the formation and 
evolution of galaxies within a hierarchical universe. Our approach 
is similar to semi-analytical models (SAMs), although we try to be 
more simple and schematic in order to encompass a large range of 
possible models. 

We claim that a significant step towards more simple, general, 
and transparent models is achieved by parameterizing the basic pro- 
cesses of galaxy formation as functions of host halo mass and red- 
shift alone. This language enables an easy and compact summary 
of the model, and results in a well defined set of differential equa- 
tions. Once each recipe is just a function of halo mass and redshift 
it is also simple to check the dependencies between recipes, and to 



The Degeneracy of Galaxy Formation Models 1 7 




9.5 10 10.5 11 11.5 

Log[ mcoid/A/g ] 



Figure 14. Cold gas mass func tion at ; = 0. Obs ervational data for the 
H I atomic gas are taken from IZwaan etal and define the lower 

part of the gray shaded region. In order to estimate the total amount of 
cold gas (atomic and molecula r) we use the model and predictions of 
lObreschkow & Rawling j i2009l) plotted as the upper part of the shaded re- 
gion. Model results are plotted according to table [T] 
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Figure 15. Average cold gas fraction, mt-oid/oistar as a function of stel- 
lar mass at ; = 0. In order to estimate the observational constraints we 
use preliminary resu lts from the GALEX Arecibo SDSS survey (GASS, 
ICatinella et'Zll2O09h . These are estimates for the average H I content for a 
complete sample of galaxies, and thus define the lower limit for the cold gas 
in our mode l (atomic and molecula r gas). Upper limits are set by the rough 
estimates of lBothwell et al.l j2009l) to the H I/H2 mass ratio and are highly 
uncertain. 



determine those that will result in a population of galaxies which 

matches the observational constraints. 

W e compare our approach to the SAM oflPe Lucia & BlaizotI 
( l2007h and show that also within this specific model, the recipes 
are effectively a function of halo mass and redshift. The only recipe 
which deviates from this approximation to some degree is the ra- 
diative gas cooling. However, for the range where the deviations are 
important (i.e. small mass haloes), it is still not clear how accurate 
the cooling approach adopted by this SAM is. Using the median 



values per halo mass and redshift for each recipe calculated from 
the original SAM, our model can produce a very similar population 
of galaxies. This demonstrates that forcing the physical recipes to 
depend only on halo mass and redshift does not affect the results of 
the SAM substantially, at least for the properties examined here. 

The general language we develop here might be useful for 
comparing the various SAMs currently in use and for highlighting 
differences between models. It can also serve as a tool for com- 
paring hydrodynamical simulations against SAMs. For example, 
it might be possible to obtain average feedback efficiencies as a 
function of halo mass and redshift using detailed simulations, and 
then to insert this information into the SAMs. This simple language 
might help to improve the communication between the different 
methodologies. 

The approach adopted here is very flexible as we can allow 
any efficiency value for a given host halo mass and redshift. We 
use this flexibility to explore very different scenarios of galaxy 
formation. We present on e standard model, which is based on 
iDe Lucia & Blaizo^jlOOTh but without an ejected phase or a thresh- 
old mass for star formation. We then introduce four scenarios, each 
characterized by a unique feature: zero feedback, cold-accretion in 
low mass haloes, stars formed only in merger-induced bursts, and 
shutdown of star-formation after mergers. In each model we tune 
the other processes so that the resulting galaxies match observa- 
tions. We do not argue that all these models are fully plausible, but 
show how different processes like feedback, cooling, and merger- 
induced bursts can compensate for each other. This also provides 
some insight into the range of efficiencies allowed by observations 
of the global properties of the galaxy population over time. For 
example, with our Model I, we are able to put a lower limit on 
the allowed cooling efficiencies given the general properties of the 
galaxy population over time. 

We test the five different models presented here against vari- 
ous observations. We show that all of them reproduce the observed 
stellar mass functions at all redshifts reasonably well. We explore 
inconsistencies related to the universal SF rate density, and show 
the level of agreement that can be achieved between these two ob- 
servations. Surprisingly, we find that all our five models reproduce 
the observed tilt in the relation of specific SF rates versus stellar 
mass, unlike other c urrent SAMs (see e.g. ISomerville et aklllOOSl : 
iFontanot et al This is likely related to the way that ejection 

and cooling is implemented in current SAMs. We plan to investi- 
gate this issue in future work. 

It is still difficult to assess what it really means if a given 
SAM manages to reproduce observations, as degeneracies are not 
well understood. How can we know whether or not a given SAM 
presents a relatively unique solution to the problem of galaxy for- 
mation and evolution? We try to improve on these limitations by 
presenting a set of models which are very different from each other. 
This can help us revisit basic issues of galaxy formation models. 
For example, we plan to investigate how galaxy merger-rates be- 
have in each model, what causes the distinction between red and 
blue galaxies, and what shapes the stellar mass function. 

We use the set of models developed here to investigate which 
observational quantity will break the degeneracy and allow us to 
differentiate between the models. The most promising such quan- 
tity is the full distribution of SF rates in different stellar mass bins. 
Specifically, SF rates for passive galaxies can give us information 
on the physics of SF although they do not affect stellar mass func- 
tion, or the cosmic SF density. On the other hand, at low SF rates 
it might be that the processes which regulate SF are very different 
from the main mode of SF. We find that it is surprisingly difficult 
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for different models to reproduce the passive fractions of galaxies 
as a function of stellar mass, which indicates that this observations 
may also be useful for constraining the models. In addition, cold 
gas fractions might be useful for discriminating between different 
models which all produce a similar galaxy population in terms of 
stellar mass and SF. Unfortunately, up to now published datasets do 
not take into account contributions from the atomic and molecular 
gas components (for estimates of gas mass ra tios see for example 
iBothwell et aLll2009l : lObreschkow et al.ll2009h . 

The formalism presented here can be viewed as an interme- 
diate step between the halo model and SAMs. This is because all 
recipes depend on halo mass only, but the galaxies are followed 
inside the complex structure of merger-trees, and their properties 
will depend strongly on the halo formation history. Consequently, 
galaxies which live inside haloes of the same mass might have 
very different SF histories. Similarly to the halo model, our model 
does not need to be parameterized a-priori, and the functional de- 
pendence of the recipes is obtained by matching the observations. 
However, unlike in the halo model, the ingredients we match (effi- 
ciencies of cooling, SF, and feedback) are directly connected to the 
underlying physical processes, and we can follow galaxies in de- 
tail through time. This level of modelling is more complex than the 
halo model, demands more computational time, and involves more 
assumptions. 

The current version of our model does not include a compu- 
tation of luminosity and color. This limits potential comparisons 
between our model galaxies and observations. For example, other 
SAMs usually compare the model against luminosity functions, 
color-magnitude diagrams, and mass-metallicity relations. On the 
other hand, comparing to these observations will require additional 
model ingredients like dust obscuration, metallicity enrichment, 
and the initial stellar mass function (IMF). It seems that from 
the above additional observational measurements, only the mass- 
metallicity relation is adding a new constraint to our model. How- 
ever, the number of ingredients that would need to be added to the 
model is quite large, so we suspect that the level of degeneracy will 
only increase. An additional shortcoming of our model is that it 
does not include any morphological information on galaxies. Since 
the establishment of the Hubble sequence it is well known that cor- 
relations between morphologies an d SF rates do exist (see a recent 
work bv ISchiminovich et alj|2007l) . However, it is unclear if this 
correlation is a consequence of the underlying physics, or if it plays 
an active role in shapi ng the SF histories of galaxies as suggested 
bv lMartiget"ai] l l2009l) . We plan to examine galaxy morphology in 
a future work. 

We do not attempt to provide improved recipes for SF, cooling 
or feedback, even though finding such recipes, which are physically 
motivated as well as in agreement with observations both on global 
and local scales, is one of the ultimate goals of semi-analytical 
modelling. We however believe that our method can help finding 
such recipes, and studying the interconnection between them, by 
opening up the allowed parameter space. 

The usual philosophy behind SAMs is that assumptions on 
the relevant physical processes are made, and it is then checked 
whether the properties of observed galaxies can be reproduced 
by tuning a certain set of parameters connected with these phys- 
ical processes. We try to solve the inverse problem: how the ob- 
served population of galaxies can constrain the physical mecha- 
nisms which drive galaxy formation. The outcome of our models 
are average efficiencies per halo mass and redshift, which need a 
further interpretation in order to have a physical meaning. For ex- 
ample, our different results for the cooling efficiencies need to be 



confronted with physical models of cooling, in order to find a sce- 
nario which is able to predict such deviations from the standard 
model. It might be that deviations due to e.g. metallicity variations 
and gas density profiles would not be able to give the required de- 
pendencies on halo mass and redshift. This dependence is more im- 
portant than our results on the absolute value of the same quantity, 
which we have shown is only poorly constrained by observations. 
Lastly, the fact that our recipes do not rely on detailed physical 
assumptions means that they generally should not be extrapolated 
to ranges in halo mass and redshift for which they have not been 
tuned. Such problems may also occur in standard SAMs, although 
in these cases some guidance for the extrapolation of each recipe is 
given. 

We note that the cosmological model used here is slightly 
different in it s para meters from the most recent estimate by 
iKomatsu et al. I 1 I2OO9I) . It might be possible to scale the merger- 
trees from the Millennium cosmology into the rece nt cosmology, 
by cha nging the time variable as was suggested bv iNeistein et al.l 
( l2009l) . If scaling of the subhalo merger-trees will only require 
changing the time variable, then it might be possible to scale our 
galaxy formation models by simply transforming only the depen- 
dence of the recipes on time. This s hould leave the result s of the 
models unchanged. However, while INeistein et al. I ( l2009l) found 
that time scaling should work well for FOF merger-trees, we use 
merger-trees based on subhaloes in this work, which might scale 
somewhat differently. 

We hope that the models developed in this work will help 
to interpret both observational data and different galaxy formation 
models. To serve this goal we provide a public online access to cat- 
alogs of our model galaxies. Our online web pag^fl is also able to 
run various galaxy formation models with user-defined parameters 
over a box size of 62.5 Mpc ^ . This volume represents the results 
obtained from the full Millennium simulation with a sufficient ac- 
curacy for many applications. We allow users to upload any kind 
of recipe which depends on halo mass and redshift. Results of the 
model are obtained within a few seconds and can be downloaded 
for further analysis. 
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APPENDIX A: EFFICIENCIES VALUES 

In tables|2]-|7]we provide numerical values for the cooling and SF 
efficiencies used by our models. We use bilinear interpolation to 
sample the values into a fine grid which is then used by our code. 



APPENDIX B: ADDITIONAL DATA - SPECIFIC SF RATES 

In this section we provide a compa rison of our model s SF rates at 
z = to the observational study bv iSalim et al. U2007i) . 
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Figure 16. The prob ability distribution of specific SF rates for different bins in stellai' mass at i = 0. The figure is similar to Fig. [T2]but it compares to the 
observational data of lSaUm et alj j2007h . 



Table 3. Same as table |2] but for Model II. 



Halo 


t = 0.80 


2.24 


3.38 


5.97 


10.27 


13.58 


mass 


z. = 7 


3.0 


2.0 


1.0 


0.3 





10.25 


-2.00 


-2.00 


-2.20 


-2.20 


-2.40 


-2.60 


10.75 


-1.20 


-1.20 


-1.40 


-1.60 


-2.20 


-2.40 


11.00 


-0.40 


-0.60 


-1.10 


-1.30 


-1.80 


-2.10 


11.25 


-0.10 


-0.40 


-0.70 


-0.80 


-1.30 


-1.80 


11.50 


0.50 


0.20 


-0.20 


-0.70 


-1.00 


-1.50 


11.75 


0.70 


0.40 


-0.10 


-0.50 


-1.00 


-1.40 


12.00 


0.80 


0.47 


0.10 


-0.50 


-1.00 


-1.50 


12.15 


0.80 


0.47 


0.10 


-0.50 


-1.00 


-2.50 


12.30 


0.80 


0.40 


-0.20 


-2.00 


-4.00 


-4.00 


12.50 


0.80 


0.20 


-2.00 


-4.00 


-4.00 


-4.00 



Table 5. Same as table |2] but for Model V. 



Halo 


t = 0.80 


2.24 


3.38 


5.97 


10.27 


13.58 


mass 


z = 7 


3.0 


2.0 


1.0 


0.3 





10.25 


-2.00 


-2.00 


-2.20 


-2.20 


-2.40 


-2.60 


10.75 


-1.20 


-1.20 


-1.40 


-1.60 


-2.20 


-2.40 


11.00 


-0.40 


-0.60 


-1.10 


-1.30 


-1.80 


-2.10 


11.25 


-0.10 


-0.40 


-0.70 


-0.80 


-1.30 


-1.80 


11.50 


0.50 


0.20 


-0.20 


-0.70 


-1.00 


-1.50 


11.75 


0.70 


0.40 


-0.10 


-0.50 


-1.00 


-1.40 


12.00 


0.80 


0.47 


0.10 


-0.50 


-1.00 


-1.50 


12.15 


0.80 


0.47 


0.10 


-0.50 


-1.00 


-1.50 


12.30 


0.80 


0.40 


-0.20 


-1.50 


-1.50 


-1.50 


12.50 


0.80 


0.20 


-1.50 


-1.50 


-1.50 


-1.50 



Table 4. Same as table|2] but for Models III & IV. 



Halo 

mass 


t = 0.80 

z = 7 


2.24 
3.0 


3.38 
2.0 


5.97 
1.0 


10.27 
0.3 


13.58 



10.25 


0.92 


0.47 


0.29 


0.05 


-0.19 


-0.31 


10.75 


0.92 


0.47 


0.29 


0.05 


-0.19 


-0.31 


11.00 


0.92 


0.47 


0.29 


0.05 


-0.19 


-0.31 


11.25 


0.92 


0.47 


0.29 


0.05 


-0.19 


-0.80 


11.50 


0.92 


0.47 


0.29 


-0.50 


-0.70 


-2.00 


11.75 


0.92 


-0.10 


-0.20 


-1.00 


-1.50 


-4.00 


12.00 


0.50 


-0.50 


-0.70 


-4.00 


-4.00 


-4.00 


12.15 


-1.00 


-1.00 


-1.00 


-4.00 


-4.00 


-4.00 


12.30 


-4.00 


-4.00 


-4.00 


-4.00 


-4.00 


-4.00 


12.50 


-4.00 


-4.00 


-4.00 


-4.00 


-4.00 


-4.00 



Table 6. Same as table |2] but for the model DLB07. Here we sample only 
few points out of the data presented in Fig.|6] 



Halo 


t = 0.80 


2.24 


3.38 


5.97 


10.27 


13.1 


mass 


z = 7 


3.0 


2.0 


1.0 


0.3 





10.25 


1.51 


0.98 


0.67 


-0.10 


-0.36 


-0.43 


10.75 


1.29 


0.83 


0.69 


-0.06 


-0.38 


-0.53 


11.00 


1.15 


0.35 


0.19 


0.53 


-0.32 


-0.49 


11.25 


0.76 


0.33 


0.17 


-0.06 


-0.28 


-0.44 


11.50 


0.69 


0.25 


0.09 


-0.10 


-0.31 


-0.37 


11.75 


0.58 


0.21 


-0.00 


-0.26 


-0.43 


-0.50 


12.00 


-2.29 


0.03 


-0.10 


-0.45 


-0.78 


-0.96 


12.25 


-1.05 


-0.28 


-0.54 


-0.89 


-4.00 


-4.00 


12.50 


-2.39 


-0.58 


-1.33 


-4.00 


-4.00 


-4.00 
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Table 7. Values of SF efficiencies, log /g, where /s is in units of Gyr~^. 
The values shown here ai'e for Model I, and ai'e plotted in Fig.|2] Halo mass 
is in units of H^^AIq and is shown at the left column, time is in Gyr. 



Halo 
mass 


t = 0.80 
z = 7 


2.24 
3.0 


3.38 
2.0 


5.97 
1.0 


10.27 
0.3 


13.58 



10.25 


1.00 


0.70 


0.50 


0.10 


-0.90 


-0.90 


10.75 


1.00 


0.70 


0.50 


0.10 


-0.90 


-0.90 


11.00 


1.00 


0.70 


0.60 


0.20 


-0.50 


-0.70 


11.25 


1.00 


0.70 


0.70 


0.30 


-0.20 


-0.70 


11.50 


1.00 


0.75 


0.80 


0.40 


-0.20 


-0.80 


11.75 


1.00 


0.75 


0.70 


0.20 


-0.50 


-0.90 


12.00 


1.00 


0.75 


0.70 


-0.70 


-1.80 


-2.50 


12.25 


1.00 


0.75 


-0.10 


-1.00 


-2.00 


-2.70 


12.50 


1.00 


0.75 


-0.70 


-2.50 


-3.00 


-3.00 



